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Abstract 

Khoshnevisan et al. (2007) proposed a general family of estimators for population mean using 
known value of some population parameters in simple random sampling. The objective of this 
paper is to propose a family of combined-type estimators in stratified random sampling adapting 
the family of estimators proposed by Khoshnevisan et al. (2007) under non-response. The 
properties of proposed family have been discussed. We have also obtained the expressions for 
optimum sample sizes of the strata in respect to cost of the survey. Results are also supported by 
numerical analysis. 



1. Introduction 



There are several authors who have suggested estimators using some known 
population parameters of an auxiliary variable. Upadhyaya and Singh (1999) and 
Singh et al. (2007) have suggested the class of estimators in simple random 
sampling. Kadilar and Cingi (2003) adapted Upadhyaya and Singh (1999) 
estimator in stratified random sampling. Singh et al. (2008) suggested class of 
estimators using power transformation based on the estimators developed by 
Kadilar and Cingi (2003). Kadilar and Cingi (2005), Shabbir and Gupta (2005, 06) 
and Singh and Vishwakarma (2008) have suggested new ratio estimators in 
stratified sampling to improve the efficiency of the estimators. 



Khoshnevisan et al. (2007) have proposed a family of estimators for population 
mean using known values of some population parameters in simple random 
sampling (SRS), given by 



t = 



aX + b 

a(ax + b) + (1 - a)(aX + b) 
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where a * 0 and b are either real numbers or functions of known parameters of 
auxiliary variable X. Koyuncu and Kadilar (2008, 09) have proposed family of 
combined-type estimators for estimating population mean in stratified random 
sampling by adapting the estimator of Khoshnevisan et al. (2007). These authors 
assumed that there is complete response from all the sample units. It is fact in 
most of the surveys that information is usually not obtained from all the sample 
units even after callbacks. The method of sub-sampling the non-respondents 
proposed by Hansen and Hurwitz (1946) can be applied in order to adjust the 
non-response in a mail survey. 

In the next sections, we have tried to propose a family of combined-type 
estimators considering the above family of estimators in stratified random 
sampling under non-response. We have discussed the properties of proposed 
family of estimators. We have also derived the expressions for optimum sample 
sizes of the strata in respect to cost of the survey. 

2. Sampling Strategies and Estimation Procedure 

Let us consider a population consisting of N units divided into £ strata. Let the 

size of /"'stratum is N it (/ = 1,2, ,k). We decide to select a sample of 

size n from the entire population in such a way that n t units are selected from the 

k 

Nj units in the /"' stratum. Thus, we have = n . Let Y and X be the study and 

i = 1 

auxiliary characteristics respectively with respective population mean TandX . It 
is considered that the non-response is detected on study variable Y only and 
auxiliary variable X is free from non-response. 



Let y t be the unbiased estimator of population mean Y , for the /"'stratum, given 

b Y 

-*_ n iiynil +n i 2 yui 2 (21) 

1 nj 

where y m] and y m2 are the means based on n a units of response group and u i2 
units of sub-sample of non-response group respectively in the sample for the 
/"'stratum. be the unbiased estimator of population mean X t , based on n i 
sample units in the /"'stratum. 



Using Hansen-Hurwitz technique, an unbiased estimator of population mean Y is 
given by 

* k * 

y s t = SPiyi ( 2 - 2 ) 

i=i 



and the variance of the estimator is given by the following expression 



_* k ( i 

v(y st )=I - 

i=iv n 



1 \ 2 e 2 £ (ki -1) w 2 C 2 

PiS yi + I— — w i2Pi s yi2 

i=l 11 i 



N 



(2.3) 



48 
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where S yi and S 2 yi2 are respectively the mean-square errors of entire group and 
non-response group of study variable in the population for the /"'stratum. 

k t = — , Pi=— and W i2 = Non-response rate of the /"’stratum in the 

u, 2 N 

population = 

N t 



2.1 Proposed Estimators 

Motivated by Khoshnevisan et al. (2007), we propose a family of combined-type 
estimators of population mean Y , given by 



_ * 

T C = Yst 



aX + b 

a(ax s t + b) + (1 - a)(aX + b) 



( 2 . 1 . 1 ) 



_ k _ 

where x st = XPi x i (unbiased for X ) 

i=i 

k 

and X= XPiXi. 

i=i 

Obviously, T c is biased. The bias and MSE can be obtained on using large 
sample approximations: 

y„ =^( 1 + e o) ; x« =x(i+e 1 ) 

such that E(e 0 ) = E(e { ) = 0 and 



VI y 



St 



1 k 



E l e oJ=^ = ^5>i 



Y 



2 

Y“ i=l 



TSi + — — W i2 SY i2 



L i^Yi 



n; 



( e 2)_ v(xst) _ 1 y 2 fs 2 
l e l/“ —2 - — 2^Pi r i^xi 
X 



Cov 



E( e o e i) = 



where /. = 



X 


2 it 




_ 


y s p 


Xst 


V 




YX 




v,.- 


n i 



= = £p, 2 f,PiS Y iSxi 

YX j = i 



N,n. 



i 2 

1 Xi 



variable in the population for the /"'stratum and p i is the correlation coefficient 
between yand X in the /"’stratum. 
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Expressing T c in terms of e l (/' = 0,l), we can write (2.1.1) as 



T c =Y(l + e 0 )[l + a^e 1 ]' g (2.1.2) 

where A = - X . 

aX + b 



Suppose \aXe\< 1 so that [l + aAe,] g is expandable. Expanding the right hand 
side of (2.1.2) up to the first order of approximation, we obtain 





- gaA-ej + 



2 1 



— gaA-egej 



(2.1.3) 



Taking expectation of both sides in (2.1 .3), we get the bias of the estimator T c as 

k ~ r g(g + l) 



B(T c ) = = ZfiP , 2 

* i=l 



-a“A,“R“S^ — ctA,gRpjS YjSxi 



(2.1.4) 



Squaring both sides of (2.1.3) and then taking expectation, we get the MSE of 
the estimator^ , up to the first order approximation, as 



(ki-0 



MSE(T c )=2;f,pf[s-i +a-X-g-R-S 2 i -2oagRpjS Y iS xi +Ip? 

i=l i=l n 



WmS 



i2° Yi2 

(2.1.5) 



Optimum choice of a 

On minimizing MSE(T C ) w.r.t. a 

5a i=1 



, we get the optimum value of a as 

p 2 SJ i - 2 XgRZf i p 2 pS Y iSxi =0 

i=l 



Z f iP , 2 Pi S Yi S Xi 

=> a (opt)= i = 1 ^ 

^gRZfipfsIi 

i=l 

Thus a iopt] is the value of a at which MSE(T C ) would attain its minimum. 



3. Optimum n i with respect to Cost of the Survey 

Let C i0 be the cost per unit of selecting n i units, C n be the cost per unit in 
enumerating n n units and C i2 be the cost per unit of enumerating u i2 units. Then 
the total cost for the i th stratum is given by 

Cj = CjQiii + Cjjiijj + Cj2 u i2 V / = 1,2,..., A: 
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Now, we consider the average cost per stratum 

W i2 



E(C,)=n, 



CiO+CiiW ll+ C i2 



k; 



Thus the total cost over all the strata is given by 



Co = IE(Ci) 

i=l 



k 

= 5>i 

i=l 



C i0 +C il W il +C; 



i2 



Wi2 

k; 



Let us consider the function 

(j) = MSE(T C )+ pCg 



(3-1) 



(3-2) 



where /u is Lagrangian multiplier. Differentiating the equation (3.2) with respect 
to n t and k t separately and equating to zero, we get the following normal 
equations. 



Pi 



<9nj 



^Yi +a 2 X 2 g 2 R 2 SJi -2aXgRp i S Yl S Xl J-2i-(k i -l)w l 2 S 2 ; 

n. 



n. 



Yi2 



+ F 



CiO +C il W il +C; 



W: 



i2 



i2 



k; 



= 0 



a».pfw, 2 s^ ^ =Q 



5k; 



k 



From the equations (3.3) and (3.4) respectively, we have 



n i =■ 



Pi V s vi + a 2 X 2 g 2 R 2 S 2 -2a^gRpiS Yi Sxi +(ki -l)W i2 S 2 



Yi2 



VpJCio + CijWi! +c i2 



Wi2 

k; 



and 

a/p = 



k iPi S Yi2 
n i V^'i2 



(3.3) 

(3.4) 



(3.5) 



(3.6) 



Putting the value of the yfju from equation (3.6) into the equation (3.5), we get 

< 3 - 7 > 

^Yi2 A i 

Where Aj = ^jc i0 +C il W il 



and Bj = +a 2 X 2 g 2 R 2 s| i -2aXgRpiS Y iSxi -W i2 S 2 i2 
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Substituting k l(opt} from equation (3.7) into equation (3.5), n, can be expressed as 

„ L 2 , (^B,W i2 SY i2 ) 

Pi f ■ + a: 

n i - V ' (3.8) 



The -JJ/ ln terms of total cost C 0 can be obtained by putting the values of k lim) 
and n i from equations (3.7) and (3.8) respectively into equation (3.1) 




Now we can express n i in terms of total cost C 0 



n i(opt) 



Co Pl 1 


„2 , (VC^ B i W i2S Y i 2 ) 


i A- 

A i 


ZPi( A i B i + V C i2 W i2 S Yi2) 

i=l \ 


^2 ! V C i2 A i W i2 S Yi2 
1 B ! 



(3.10) 



Thus n l(opt) can be obtained by equation (3.10) by putting different values of 

W i2 and k r 



4. Numerical Analysis 

For numerical analysis we have used data considered by Koyuncu and Kadilar 
(2008). The data concerning the number of teachers as study variable and the 
number of students as auxiliary variable in both primary and secondary school for 
923 districts at 6 regions (as 1 : Marmara, 2: Agean, 3: Mediterranean, 4: Central 
Anatolia, 5: Black Sea, 6: East and Southeast Anatolia) in Turkey in 2007 
(Source: Ministry of Education Republic of Turkey). Details are given below: 



Table No.4.1: Stratum means, Mean Square Errors and Correlation 
Coefficients S m 



Stratum 

No. 


A , 


n i 


Yi 


X,- 


s Yi 




s XYi 


Pi 


s Yi2 


1 


127 


31 


703.74 


20804.59 


883.835 


30486.751 


25237153.52 


.936 


440 


2 


117 


21 


413.00 


9211.79 


644.922 


15180.769 


9747942.85 


.996 


200 


3 


103 


29 


573.17 


14309.30 


1033.467 


27549.697 


28294397.04 


.994 


400 


4 


170 


38 


424.66 


9478.85 


810.585 


18218.931 


14523885.53 


.983 


405 


5 


205 


22 


527.03 


5569.95 


403.654 


8497.776 


3393591.75 


.989 


180 


6 


201 


39 


393.84 


12997.59 


711.723 


23094.141 


15864573.97 


.965 


300 
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Table No.4.2: % Relative efficiency (R.E.) of T c w.r. to y st at a iopt) , a =l,b = \ 



w, 2 




r.e.{t c ) 


0.1 


2.0 


914.25 


2.5 


834.05 


3.0 


768.23 


3.5 


713.25 


0.2 


2.0 


768.23 


2.5 


666.62 


3.0 


591.84 


3.5 


534.49 


0.3 


2.0 


666.62 


2.5 


561.39 


3.0 


489.12 


3.5 


436.42 


0.4 


2.0 


591.84 


2.5 


489.12 


3.0 


421.89 


3.5 


374.47 



5. Conclusion 

We have proposed a family of estimators in stratified sampling using an auxiliary 
variable in the presence of non-response on study variable. We have also 
derived the expressions for optimum sample sizes in respect to cost of the 

survey. Table 4.2 reveals that the proposed estimator T c has greater precision 

— * 

than the usual estimator y st under non-response. 



References 

1. Hansen, M. H., Hurwitz, W. N. (1946): The problem of non-response in 
sample surveys. Journal of American Statistical Association, 41 , 517-529. 

2. Kadilar, C., Cingi, H. (2005): A new estimator in stratified random 
sampling. Communication in Statistics Theory and Methods, 34, 597-602. 

3. Kadilar, C., Cingi, H. (2003): Ratio estimator in stratified sampling. 
Biometrical Journal, 45, 218-225. 

4. Khoshnevisan, M., Singh, R., Chauhan, P., Sawan, N., Smarandache, F. 
(2007): A general family of estimators for estimating population mean 
using known value of some population parameter(s). Far East Journal of 
Theoretical Statistics, 22, 181-191. 



Pak.j.stat.oper.res. Vol.V No.1 2009 pp47-54 



53 





Manoj K. Chaudhary, Rajesh Singh, Rakesh K. Shukla, Mukesh Kumar, Florentin Smarandache 

5. Koyuncu, N., Kadilar, C. (2008): Ratio and product estimators in stratified 
random sampling. Journal of Statistical Planning and Inference, 139, 8, 
2552-2558. 

6. Koyuncu, N., Kadilar, C. (2009): Family of estimators of population mean 
using two auxiliary variables in stratified random sampling. 
Communication in Statistics Theory and Methods, 38:14, 2398-2417. 

7. Shabbir, J. Gupta, S. (2005): Improved ratio estimators in stratified 
sampling. American Journal of Mathematical and Management Sciences, 
25, 293-311 

8. Shabbir, J. Gupta, S. (2006): A new estimator of population mean in 
stratified sampling. Communication in Statistics Theory and Methods, 35, 
1201-1209. 

9. Singh, H., P., Tailor, R. Singh S. and Kim, J. M. (2008): A modified 
estimator of population mean using power transformation. Statistical 
papers, Vol-49, No.1, 37-58. 

10. Singh, H., P., Vishwakarma, G. K. (2008): A family of estimators of 
population mean using auxiliary information in stratified sampling. 
Communication in Statistics Theory and Methods, 37(7), 1038-1050. 

11. Singh, R., Cauhan, P., Sawan, N. and Smarandache, F. (2007): Auxiliary 
information and a priori values in construction of improved estimators. 
Renaissance High press. 

12. Upadhyaya, L.N., Singh, H.P. (1999): Use of transformed auxiliary 
variable in estimating the finite population mean. Biometrical Journal. 41, 
627-636. 



54 



Pak.j.stat.oper.res. Vol.V No.1 2009 pp47-54 




